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Abstract 

Several methods to extract an asymmetry parameter in an event distribution func- 
tion are discussed and compared in terms of statistical precision and applicability. 
These methods are: simple counting rate asymmetries, event weighting procedures 
and the unbinned extended maximum likelihood method. It is known that weight- 
ing methods reach the same figure of merit (FOM) as the likelihood method in the 
limit of vanishing asymmetries. This article presents an improved weighting proce- 
dure reaching the FOM of the likelihood method for arbitrary asymmetries. Cases 
where the maximum likelihood method is not applicable are also discussed. 
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1 Introduction 



We consider two differential event distributions n^{x) following the functional 
form 

n^{x) = a{x){l±l3{x)A). (1) 



In a typical experimental situation encountered in particle physics a{x) in- 
cludes a flux and acceptance factor and is an analyzing power. Both 
depend on a set of kinematic variables here denoted by x. Concrete examples 
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are spin cross section asymmetries and muon decay. In the latter case the 
asymmetry corresponds to the muon polarization. The two data sets (+ and 
— ) are for example obtained by changing the sign of a polarization. The goal is 
to extract the parameter A by measuring the event distributions n^{x). This 
would be an easy task if both a{x) and (3{x) were known. However in many 
applications the factor a{x) is not known, or not accurately enough known 
and only (]{x) is given. 

Section [2] presents various methods to extract the parameter A: The simplest 
method, based on counting rate asymmetries, the more efficient extended un- 
binned maximum likelihood (EML) method and finally methods based on 
event weighting are discussed. While in Section [2] we assume that the factor 
a{x) is the same for both data sets. Section [3] extends the discussion to the 
case where one has two different factors. Up to Section [3] we assume that the 
number of observed events is large enough so that averages can be replaced by 
the corresponding expectation values. Effects occurring at low statistics event 
samples are discussed in Section |H A summary and conclusions are given in 
Section [51 



2 Different Methods to extract A 

2. 1 Determining A from counting rate asymmetry 

The expectation value of the number of events for the two data sets reads 
^iV±^ = j n^(x)dx = (1 ± A) J a{x)dx (2) 



with (/?) = / a(3dx/ J adx. The integrals run over the kinematic range of x. 
The asymmetry A can be extracted without the knowledge of / a{x)dx: 

A=±i^n^s^ (3) 



Eq. (I3l) leads to following estimator for A: 

7V+ + A^- - N~ N+ - N- 



A 



cnt 



where /3i = A^"^ and A^ are the numbers of observed events. The sums 

E+ and E- run over all events in the corresponding data set (+ or — ). 
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As shown in Section I2l3] and App.[Xl the figure of merit (FOM), i.e. the inverse 
of the variance on A^nt, is 

FOM = A^ — (4) 



where A^ = A^+ + A^ denotes the total number of events. 

This figure of merit may be increased if a cut is set to remove some data with 
low values of (3. However, it will not reach the FOM of the unbinned extended 
likelihood method discussed now, unless (3{x) is constant. 

2.2 Extended Maximum Likelihood (EML) Method 

We now turn to the unbinned extended maximum likelihood (EML) method [T|2] 
which is known to reach the Cramer- Rao limit of the lowest possible statistical 
error. The log-likelihood function derived from Eq. ([1]) reads 

/ = ^ In (a,(l + I3,A)) - (A^+) {A) + ^ In - A^)) - {N-){A) . 
+ 

Using the expression in Eq. ([2]) for the expectation values (A^^) results in 



/ = ^ln(l + AA) + ^ln(l-/3iA)-2 / a(x)dx - ^ Ina^ . (5) 
+ - +- 

The last two terms do not depend on A and can be ignored in the likelihood 
maximization. The asymmetry A can thus be determined without knowledge 
of a{x). For small values of [3 A one can even derive an analytic expression for 
Alh which reads 



For arbitrary asymmetries the maximization has to be done numerically. Note, 
that this requires CPU intensive sums over all events in the maximization 
procedure. 

The figure of merit (FOM) is given by 



^(1 + AA)2 ^{l-P.Af 
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Replacing the sum over events by integrals one finds 



FOM^^^ = / -^^ „ dx + / „ .'^ dx 



[l + (3Ay J (1 -/3A)2 



da; . (7) 



1 - 

Noting, that for an arbitrary function f{x) the average is defined by 
/ f{x){n'^{x) + n~ {x))dx J a{x)f{x)dx 



if) 



J n+(x) + n {x)dx J a{x)dx 



the figure of merit can be written as 

FOM,„ = A'(^^). (8) 

2.3 Weighting Method 

Next, consider the following estimator 

where Wi = w{xi) is a, for the moment arbitrary, weight factor assigned to 
every event. The expectation value of equals A independently of the weight 
function w{x) used. App. [Al shows that 

FOM^ =N-—p^^\——. (10) 
Two cases are of interest: 

1. ) Setting w = 1 corresponds to the counting rate asymmetry discussed in 
Section O and proves Eq. (g]) for the FOM. 

2. ) In the case w = (3 the FOM is 



FOM^ ^ = 



w=f3 



1 _ ^2M' 



A comparison with Eq. ([8|) indicates that FOM j coincides with the FOM 
of the likelihood method for vanishing A. Actually, in this case, the two es- 



4 



timators are identical as can be seen by comparing Eq. ([9]) with w = (3 and 
Eq. ([6]). Note that the estimator can be apphed for arbitrary asymmetries 
as well, accepting a decrease of the FOM compared the EML estimator as 
discussed in Section 1^751 

Such a weighting procedure has been used for example in Refs. pil] to extract 
spin asymmetries in the case where A <C 1. In Ref. [5] a weighting method 
is discussed to simultaneously extract signal and background asymmetries. 
The fact that a weighting procedure reaches the same FOM as the EML 
method was first discussed in Ref. [7] in the context of signal and background 
extractions. The next section shows that one can find a weight factor reaching 
the FOM of the EML method even in the case of non- vanishing asymmetries. 



2.4 Improved Weighting Method 



Variational calculus shows (s. App. |B]) that the maximum FOM is reached 
using a weight factor 



Here, Aq is a first estimate of the asymmetry A obtained for example from 
the weighting method presented in Section 12.31 The weighting factor defined 
in Eq. ( fTTl) leads to the following estimator (the index iw stands for improved 
weight) 



Mi 


-E- 






+ E- 









— ffL fl2 • l-^-^J 



Eq. ([ID]) gives 



FOM J =iV f s • (13) 

1 P n _/q2 A'i\2 



(1-/32^2)2 

Thus given a good estimate Aq ^ A, we get FOM^.^ = FOM^^^, i.e. the 
improved weighting method reaches the same FOM as the EML method for 
arbitrary asymmetries as well with the advantage that no CPU consuming 
maximization procedure is needed. 
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Counting rate 
asymmetry 



weighting 
method 



Likehhood 
method 



Improved weighting 
method 



Figure of merit 

N 



Table 1 

The ratio FOM/A^ for various methods discussed. 

Before we move to a comparison of the different methods, we note that the 
estimator 



reaches as well the FOM of the EML for A ^ Aq, In contrast to Ai^j its 
expectation value only equals A ii Aq ^ A. For r decays the optimal weight 
factor in Eq. (fill) is discussed in Ref. [B]. 



2. 5 Comparison of different methods 



Tab. [T] summarizes the FOM of the various estimators proposed. Fig. [T] shows 
the figure of merit of the different estimators vs. A for the choice 



a{x) = const. = 2500 and (3{x) = x, < x < 1 . (15) 

The curves are analytic calculations. The points are results of simulations. 
For each value of the asymmetry 10000 configurations with a = 2500, which 
corresponds on average to 5000 events, were simulated. One configuration 
consists of a plus and minus data set used to evaluate an asymmetry. For each 
of the 10000 configurations simulated, the asymmetries were calculated using 
the estimators discussed above. The FOM was determined from the RMS of 
the asymmetry distributions. 

The results are in perfect agreement with the analytic calculations. The sta- 
tistical errors of the simulations are of the order of the size of the points. 
Note, that for all methods no bias was found for the asymmetry. The question 
of bias and the range of validity of the expressions for the FOM will be dis- 
cussed in more detail in Section HI The weighting methods are superior to the 
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Fig. 1. Figure of merit for different methods as a function of the asymmetry A. 

method using simply the counting rates. As expected the improved weighting 
or the EML method reach a higher FOM than the simple weighting method 
the larger the asymmetry A. The results depend of course on the shape of 
a{x) and The gain in FOM using weighted events compared to counting 
rates depends on the spread of For A = for example it is / 

as can be derived from Eq. ( fTOl) . 



Fig. [2] shows the influence of the choice of Aq on the FOM in the improved 
weighting method for the factors a and /3 as given in Eq. ( lT5l) and an asym- 
metry A = 0.8. Choosing Aq in a range 0.7-0.86, one reaches at least 99% of 
FOM J . The normal weighting method corresponds to Aq = 0. 



3 Different acceptance/flux factor in the two data sets 



We now turn to the case where the acceptance and flux factor a is not the 
same in the two data sets. We assume that they differ by a known factor c 
which is independent of x. In this case the differential event distributions are 
given by 



n~^{x) 



2c 



a{x){l + (3{x)A) and 



1 + c 
2 

n~{x) = a(x)(l - f3{x)A) 



(16) 
(17) 
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Fig. 2. FOM J /FOM 1 for A = 0.8 as a function of Aq. 

The factor 2/(1 +c) has been introduced in order to normahze the distributions 
to the same number of events for all values of c at A = 0. The log likelihood 
function reads 



^ = E In «,(1 + AA)) - (iV+) {A) + 



^In (^-_a,(l-A^)j-(iV-)(A) 
:^ln(l + A^) + Eln(l-A^) 



2c 



+ In ai + In — 

4" 1 + c ^ 1 



-2 / adx - 2A — -faPdx. (18) 
J 1 + c J 

Here the last term cannot be ignored because it contains the parameter A and 
thus the likelihood method cannot be applied without knowledge of the factor 
/ aPdx. The weighting method on the other hand can be applied with a small 
modification: 



A 



w,c 



(19) 



The expectation value of Ay^^c equals again A. The figure of merit reads (deriva- 
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Fig. 3. Figure of merit for three different values of c as a function of the asymmetry 
A, assuming the same number of events in the case ^ = 0. 

tion see s. App. ICl) 



FOM 



A.U 



N- 



4c 



(l - 



(20) 



The FOM is shown in Fig. [3] for different values of c for the improved weighting 
method. As in Fig. [1] the lines correspond to an analytic calculation using 
Eq. (120]) . the points are results of simulations. Note, that for arbitrary c the 
weight reaching the highest FOM is 

^ 

W = 7 ^ . 

(1 - /3M2) (l - (3A'j^) 



The EML method could be used for c ^ 1, if one uses an estimate for / a/5dx ~ 
J2+f3{xi) + cJ2- /^i^i) from data. Simulations showed that the FOM of this 
modified EML method equals the one of the improved weighting method. 



4 Validity at low number of events 



In this section we discuss the validity of the equations derived for the various 
FOMs and possible biases of the estimators. The formulas were derived us- 
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Fig. 4. Combining data in different configurations. The boxes denote tlie plus and 
minus data sets. 

ing usual error propagation and can thus only be approximations which are 
the better the higher A^. In the simulations presented in Section 12.51 for every 
value of the asymmetry 10000 configurations were simulated with on average 
5000 events (corresponding to a = 2500 in Eq. f|T5|) ). In each of these config- 
urations the asymmetry was determined using the various estimators. In this 
section we discuss effects occurring if the asymmetries are extracted in smaller 
configurations, as indicated in Fig. IH 

For lower number of events in one configuration one reaches a point where due 
to statistical fluctuations the estimated asymmetry in a given configuration 
can be larger than 1 or smaller than —1. In this case the EML method is no 
more applicable since the term (1 ± I3A) can get negative. For a = 250 this 
happens in about 1% of the configurations for an asymmetry of A = 0.8. 

Dividing the sample further in many smaller configurations one reaches a 
point where one has only or 1 event in one configuration. This limit can 
also be obtained by dividing the sample in many narrow bins of /3 having at 
most one event in a bin. In this case the remaining estimators are identical: 
Acnt = ^w=i3 = -^iw = ±l//3i- The sign depends whether the event occurred 
in the plus or the minus data set. One finds = A and {^A^J^ = l^f-, 

thus the FOM for this event reads 



FOM, 



Aj, 



/I,;, 



1 - PIA^ 



Note that A is not the estimated asymmetry from a single event but rather 
taken from a larger event sample. Combining all the asymmetries determined 
on single events leads to 



(21) 
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Assuming A = Aq, Eq. ( 12T|) is nothing but the estimator of the improved 
weighting method, Ai^, defined in Eq. ( !T2|) . In other words, for the improved 
weighting method it makes no differences whether the data are analyzed in 
one large configuration or in many small ones. The improved weighting is 
also equivalent to using an infinite number of bins in /3 and evaluating the 
asymmetries in every bin and then combining the results. The advantage of 
the improved weighting method is that this binning has not to be performed. 

The observations discussed above are confirmed by simulations. In total 10^ 
configurations with a = 0.025 were simulated. The simulated data were ana- 
lyzed as follows. First the asymmetries were calculated in the approximately 
5% of the configurations actually containing at least one event. Then the 
weighted average of these asymmetries is calculated. The same data were an- 
alyzed in a different way by combining 10 configurations and calculating the 
asymmetries in these larger configurations corresponding to a = 0.25. This 
procedure was repeated until reaching 10^ configurations with a = 2500. Fig.H] 
illustrates the procedure. These simulations were performed for an asymmetry 
of 0.8 generating events in the range 0.01 < (5{x) < 0.99 . 

Fig. E] shows the mean value of the asymmetries and the statistical error for 
the various estimators for the different values of a. As expected the estimator 
Aiy, gives the same result independent of a. No bias is observed within the 
statistical error which is of the order of 10~^. The asymmetry for the EML 
method is only shown for a = 2500 since at lower values, as explained above, 
the EML method is no more applicable. 

Fig. [6] shows the distribution of the asymmetry for different values of a. 
The entries in the histograms in Fig. [6] are weighted by their corresponding 
FOM. In the case a = 2500 this would not be necessary because all entries 
have essentially the same FOM for a given method since the relative variation 
of the number of events and the averages like X]± Pf entering the FOM 
vary only very little from configuration to configuration. At lower values of a, 
however, this is no more the case. Taking again the extreme case where the 
asymmetries is calculated from single events, the FOM depends on the value 
of /3 for this event. This explains why the number of entries is smaller than 
1 in some bins of the histograms. At a = 0.025 the number of configurations 
is 10^. The corresponding histogram has only approximately 4.9 ■ 10'' entries 
refiecting the fact that in most of the configuration there is no event. 

Finally, Fig. [7] shows the FOM/N calculated from the RMS of the asymmetry 
distributions presented in Fig. [H] for different values of a. The three lines 
correspond to FOM^^^/N, FOM^^^^A^ and FOM^^^^A^ calculated using the 
expressions given in Tab. [H For a > 25 there is good agreement with the 
FOM derived in Section [2] since the points coincide with the corresponding 
lines. At lower values of a the FOM of the weighting and the counting rate 
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Fig. 5. Results for the asymmetry of the simulations as a function of 
the average number of events in one configuration. The points are at 
a = 0.025,0.25,2.5,25,250,2500, respectively. For a given value of a they are 
slightly displaced on the horizontal axis for better readability. Note, that values 
at different values of a are correlated since the same data were used. 

method start to increase and finally reach as expected FOM^, at a = 0.025, 
where the three estimators are practically identical. 



5 Summary Sz Conclusions 



We presented several estimators to extract an asymmetry parameter A in 
a number density function. These estimators were based on counting rates, 
event weighting and the unbinned extended maximum likelihood method. A 
weighting procedure was derived that reaches the same figure of merit as the 
unbinned maximum likelihood method, known to reach the minimal variance 
bound. This weighting estimator is given as an analytic expression, whereas in 
the EML method the maximization of the likelihood function has to be done 
numerically. Moreover this estimator can be used (with a small modification) 
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Fig. 6. Distributions of the estimated asymmetries Ai^j for different values of a. 
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Fig. 7. Figure of merit per event for different methods as a function of a for an 
asymmetry A = 0.8. The lines show the expectation calculated from the expressions 
given in Tab. [H 
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in cases where the EML method cannot be apphed because of an incomplete 
knowledge of event distribution function. 

Acknowledgments: I am grateful to Jean-Marc Le Goff for numerous discus- 
sions on the subject, verifying the calculations and for carefully reading the 
manuscript. 
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A Figure of merit of 



To calculate the FOM of defined in Eq. ([9]), one needs <J^{Y1 Wi), cr'^iYl Wi/3i) 
and cov(J2 'Wi,J2wif3i). For two arbitrary quantities / and g the covariance 
between J2 fi and 9i is 



* 3 

= (E f^9^ + E /^^?.) - (E (E^?.) (A.2) 

= (N) {fg) + {N{N - 1)) (/) {g) - {Nf if) {g) (A.3) 

= (N) (fg) + {{n') - (N) - (Nf) if) (g) . (A.4) 

If the number of events N is Poisson distributed, i.e. (A^) — (A) — (A)^ = 0, 
one finds 

cov(E E 93) = (N) (fg) ^ E f^9^ . (A.5) 



Setting f = g = w,f = g = wf3 and f = w,g = w(3 results in 



AT.^^) = {N) {w') , (A.6) 
a\Y.^A) = {N) {{w^f) , (AT) 
cov(E w,, E wA) = (N) lw^(3) . (A.8) 



Simple error propagation in Eq. ([9]) finally leads to Eq. ffTOl) for the figure of 
merit. 



B Optimal weight 

Denoting the weight factor which maximizes the FOM by wq, we consider 
small deviation from this optimum by 

w{x) = Wq{x) + erj{x) (B.l) 
where ri{x) is arbitrary and e <^1. 
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Inserting Eq. (1B.1|) in Eq. ( ITOj) keeping terms of 1st order in e one finds 



FOM 



(K + 2eu;or/)(l-/3M2)) ' 



(B.2) 



The condition dYOVi/de = gives 

/3 



Wo 



1 - P^A^ 



C FOM for the case c ^ 1 



The error for the estimator defined in Eq. (fT9|) is obtained by simple error 
propagation taking into account the correlations between J2w(3 and J^u!. 



with 



and 



dA, 



^ :i, -c, -A, -cA) 



(C.l) 



" E+<A 
E-w? E-W7-A 
E+«^'/3i E+(w^.A)' 
V E-«^-A T.^{wAf I 

This leads to Eq. ([20]). 



(C.2) 
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